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Period for Reply 



^^^"SHORTENED^S'TAT - 

THE MAILING DATE OF THIS COMMUNICATION. 

- Extensions of time may be available under the provisions of 37 CFR 1 .136(a). In no event, however, may a reply be timely filed 
after SIX (6) MONTHS from the mailing date of this communication. 

- If the period for reply specified above is less than thirty (30) days, a reply within the statutory minimum of thirty (30) days will be considered timely. 

- If NO period for reply is specified above, the maximum statutory period will apply and will expire SIX (6) MONTHS from the mailing date of this communication. 

- Failure to reply within the set or extended period for reply will, by statute, cause the application to become ABANDONED (35 U.S.C. § 133). 
Any reply received by the Office later than three months after the mailing date of this communication, even if timely filed, may reduce any 
earned patent term adjustment. See 37 CFR 1.704(b). 

Status 

1 )M Responsive to communication(s) filed on 01 July 2004 . 
2a)[x] This action is FINAL. 2b)D This action is non-final. 

3) D Since this application is in condition for allowance except for formal matters, prosecution as to the merits is 

closed in accordance with the practice under Ex parte Quay/e, 1935 CD. 11, 453 O.G. 213. 

Disposition of Claims 

4) H Claim(s) 1-12 is/are pending in the application. 

4a) Of the above claim(s) is/are withdrawn from consideration. 

5) D Claim(s) is/are allowed. 

6) M Claim(s) 1^12 is/are rejected. 

7) D Claim(s) is/are objected to. 

8) D Claim(s) are subject to restriction and/or election requirement. 

Application Papers 

9) D The specification is objected to by the Examiner. 

10)D The drawing(s) filed on is/are: a)D accepted or b)D objected to by the Examiner. 

Applicant may not request that any objection to the drawing(s) be held in abeyance. See 37 CFR 1.85(a). 

Replacement drawing sheet(s) including the correction is required if the drawing(s) is objected to. See 37 CFR 1.121(d). 
1 1 )□ The oath or declaration is objected to by the Examiner. Note the attached Office Action or form PTO-1 52. 
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1 .13 Certified copies of the priority documents have been received. 

2. D Certified copies of the priority documents have been received in Application No. . 

3. D Copies of the certified copies of the priority documents have been received in this National Stage 

application from the International Bureau (PCT Rule 17.2(a)). 
* See the attached detailed Office action for a list of the certified copies not received. 
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DETAILED ACTION 



Specification 

1 . This action is responsive to communication filed on July 01 , 2004. This 
amendment has been entered and carefully considered. Claims 1.-12 are again 
presented for examination. 

2. The certified copy of the foreign priority document has been received. 

3. The rejection of claims 1-12 as in the Office Action mailed April 02, 2004 is 
respectfully maintained and reiterated below for Applicant's convenience. 

Claim Rejections - 35 USC § 103 

The following is a quotation of 35 U.S.C. 103(a) which forms the basis for 
all obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described 
as set forth in section 1 02 of this title, if the differences between the subject matter sought to 
be patented and the prior art are such that the subject matter as a whole would have been 
obvious at the time the invention was made to a person having ordinary skill in the art to which 
the subject matter pertains. Patentability shall not be negatived by the manner in which the 
invention was made. 

4. Claims 1, 11 and 12 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Gulley et al. (USPN: 5,025,407) hereinafter, Gulley in view of 
Messina etal. (USPN: 4,317,168) hereinafter, Messina. 

As per claim 1 , Gulley teaches a data processing apparatus comprising a 
main processor (the graphics processor 120 in Fig. 1) responsive to program 
instructions to perform data processing operations and a coprocessor (the 
floating point coprocessor 1200 in Fig. 1) coupled to the main processor and 
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responsive to a coprocessor load instruction (an instruction) on the main 
processor to load one or more loaded data words into the coprocessor and 
perform at least one coprocessor processing operation specified by the 
coprocessor load instruction using the one or more loaded data words to provide 
operand data to generate at least one result data word (e.g. see Col. 2, lines 3- 
10 and Fig. 1). However, Gulley does not teach that the number of loaded data 
words loaded into the coprocessor is depended upon whether or not the start 
address of the operand data is aligned with a word boundary. 

Messina, on the other hand, teaches that the number of loaded data 
words (the quad words, QW) loaded (for the line fetch, LF) is depended upon the 
operand data alignment within the word boundary, i.e. 8 or 9 quad words (QW) 
occur for a line fetch (LF) depending upon the double word (DW) boundary 
alignment (e.g. see Abstract). Accordingly, it would have been obvious to one of 
ordinary skill in the art at the time of the current invention was made to implement 
the step of deciding whether to load one or more loaded data words based on the 
operand data alignment within the word boundary as taught by Messina in 
Gulley' s apparatus. In doing so, the coprocessor load instruction gets the 
required number of operands and can start the execution of the load instruction 
without waiting for the remaining operands. Therefore, the number of clock 
cycles required for the execution of the coprocessor load instruction is reduced. 

As per claims 1 1 and 12, Gulley teaches a method of processing data and 
a computer program product for controlling a computer comprising the steps of: 
in response to program instructions performing data processing operations in a 
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main processor (the graphics processor 120 in Fig. 1) and in response to a 
coprocessor load instruction i_(an i_ instruction) on the mair^ processor loading one 
or more loaded data words into a coprocessor (the floating point coprocessor 
1200 in Fig. 1) coupled to the main processor and performing at least one 
coprocessor processing operation specified by the coprocessor load instruction 
using the one or more loaded data words to provide operand data to generate at 
least one result data word (e.g. see Col. 2, lines 3-10). However, Gulley does 
not teach that the number of loaded data words loaded into the coprocessor is 
depended upon whether or not the start address of the operand data is aligned 
with a word boundary. 

Messina, on the other hand, teaches that the number of loaded data 
words (the quad words, QW) loaded (for the line fetch, LF) is depended upon the 
operand data alignment within the word boundary, i.e. 8 or 9 quad words (QW) 
occurred for a line fetch (LF) depending upon the double word (DW) boundary 
alignment (e.g. see Abstract). Accordingly, it would have been obvious to one of 
ordinary skill in the art at the time of the current invention was made to implement 
the step of deciding whether to load one or more loaded data words based on the 
operand data alignment within the word boundary as taught by Messina in 
Gulley's method and computer program. In doing so, the coprocessor load 
instruction gets the required number of operands and can start the execution of 
the load instruction without waiting for the remaining operands. Therefore, the 
number of clock cycles required for the execution of the coprocessor load 
instruction is reduced. 
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5. Claims 2-10 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Gulley in view of Messina further in view of York et al. (USPN: 6,002,881) 
hereinafter, York. 

As per claim 2, the combination of Gulley and Messina teaches the 
claimed invention as described above. However, both Gulley and Messina failed 
to teach that the coprocessor includes a coprocessor memory for storing one or 
more locally stored data words used as operands in the at least one coprocessor 
processing operation in combination with the one or more loaded data words. 
York, on the other hand, teaches that the coprocessor (Piccono coprocessor 4 in 
Fig. 1) includes a coprocessor memory (registers 10 in Fig. 2) for storing one or 
more data words, which includes data words used as operands and loaded data 
words (emphasis added) (e.g. see Figs. 1-2 and Col. 5, lines 44-57). 
Accordingly, it would have been obvious to one of ordinary skill in the art at the 
time of the current invention was made to implement the coprocessor memory in 
the coprocessor for storing locally stored data words along with the loaded data 
words as taught by York in Gulley and Messina's data processing apparatus. In 
doing so, the coprocessor retrieves these data words faster than storing it 
elsewhere (not locally to the coprocessor), which reduces the data latency and 
therefore, the performance of the coprocessor increases. 

As per claim 3, the combination of Gulley and Messina teaches the 
claimed invention as described above. However, both Gulley and Messina failed 
to teach that the data processing apparatus comprising a memory coupled to the 
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main processor and wherein the main processor is configured to retrieve the one 

or more loaded datajwords from the memory to the coprocessor via the main 

processor without being stored within registers within the main processor. York, 
on the other hand, teaches a memory coupled to the main processor and 
wherein the one or more loaded data words are retrieved from the memory to the 
coprocessor via the main processor without being stored within registers within 
the main processor (e.g. see Col. 1, lines 18-34). Accordingly, it would have 
been obvious to one of ordinary skill in the art at the time of the current invention 
was made to modify the data processing apparatus taught by Gulley and 
Messina as such so the loaded data words can be retrieved from the memory to 
the coprocessor via the main processor without being stored within registers 
within the main processor as taught by York. In doing so, the data retrieval time 
reduces and therefore, the overall performance of the data processing apparatus 
increases. 

As per claim 4, the combination of Gulley and Messina teaches the 
claimed invention as described above. However, both Gulley and Messina failed 
to teach that the main processor includes a register operable to store an address 
value pointing to the one or more data words. York, on the other hand, teaches 
that the main processor (the CPU) includes a register, which holds an address 
value pointing to the data words (e.g. see Col. 2, lines 42-51). Accordingly, it 
would have been obvious to one of ordinary skill in the art at the time of the 
current invention was made to modify Gulley and Messina's apparatus by adding 
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an register in the main processor for storing an address value as taught by York 
so the start address within the_memor y to be access ed is ^determ ined. 

As per claims 5 and 6, the combination of Gulley and Messina teaches the 
claimed invention as described above. However, both Gulley and Messina failed 
to teach that at least one coprocessor processing operation includes calculating 
a sum of absolute differences between a plurality of byte values within the one or 
more loaded data words and corresponding ones of a plurality of byte values 
within the one or more locally stored data words. York, on the other hand, 
teaches that one of the coprocessor processing operation (the SUBA instruction) 
calculates sum of differences (e.g. see Col. 36, lines 55-58). Accordingly, it 
would have been obvious to one of ordinary skill in the art at the time of the 
current invention was made to modify Gulley and Messina's apparatus so the 
SUBA instruction can be run as taught by York. In doing so, the sum of 
differences between byte values within loaded and stored data words is 
calculated for the correlation purposes. 

As per claim 7, the combination of Gulley and Messina teaches the 
claimed invention as described above. However, both Gulley and Messina failed 
to teach that the sum of absolute differences is accumulated within an 
accumulate register of the coprocessor. York, on the other hand, teaches that 
sum of differences that calculated by the SUBA instruction is accumulated 
(added) in an accumulate register (the third register) (e.g. see Col. 36, lines 55- 
58). Accordingly, it would have been obvious to one of ordinary skill in the art at 
the time of the current invention was made to modify Gulley and Messina's 



Application/Control Number: 10/042,354 Page 8 

Art Unit: 2186 

apparatus so the sum of differences is accumulated in the accumulate register as 
taught_by York. In doing so, the sum of dif ferences can^be retrieved anytime J>y_ 
the coprocessor for any required manipulation. Since it is stored locally in the 
coprocessor register, coprocessor can retrieve it quickly compare to if it is stored 
elsewhere. 

As per claim 9, the combination of Gulley and Messina teaches the 
claimed invention as described above. However, both Gulley and Messina failed 
to teach that the coprocessor load instruction includes an offset value to be 
added to the address value upon execution. York, on the other hand, teaches 
that the offset value (offset field within the instruction) is used by the CPU to 
specify the changes to be made in the address value provided by the CPU upon 
execution of a particular instruction (e.g. see the abstract and Col. 2, lines 42- 
51 ). Accordingly, it would have been obvious to one of ordinary skill in the art at 
the time of the current invention was made to modify Gulley and Messina's 
apparatus so upon the execution of an instruction, an offset that included in the 
instruction is added to the address value as taught by York. In doing so, the 
actual address is calculated from the given address value by adding an offset to 
that given address. 

6. Claim 8 is rejected under 35 U.S.C. 103(a) as being unpatentable over 
Gulley in view of Messina further in view of Langendorf et al. (USPN: 4,860,197) 
hereinafter, Langendorf. 
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As per claim 8, the combination of Gulley and Messina teaches the 
daimed inv ention as d escribed above^ However , both Gulle y and IVIessina failed 
to teach that the coprocessor includes an alignment register for storing a value 
specifying alignment between the operand data and the one or more loaded data 
words. Langendorf, on the other hand, teaches that the system includes one or 
more memory sets for storing alignment values which represent whether the 
boundary of the instruction with one or more parcels (e.g. see the abstract and 
claim 6). Accordingly, it would have been obvious to one of ordinary skill in the 
art at the time of the current invention was made to implement the alignment 
register for storing alignment value as taught by Langendorf in Gulley and 
Messina's apparatus so the required number of operands are loaded based on 
the alignment value and the execution of the load instruction is started without 
waiting for the remaining operands. Therefore, the number of clock cycles 
required for the execution of the coprocessor load instruction is reduced. 

7. Claim TO is rejected under 35 U.S.C. 103(a) as being unpatentable over 
Gulley in view of Messina further in view of Wu et al. (USPN: 6,418,166) 
hereinafter, Wu. 

As per claim 10, the combination of Gulley and Messina teaches the 
claimed invention as described above. However, both Gulley and Messina failed 
to teach that at least one coprocessor processing operation calculates a sum of 
absolute differences as part of block pixel value matching. Wu, on the other 
hand, teaches that the sum of differences is used as the search criteria in the 
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block matching process (e.g. see Fig. 8 and Col. 4, lines 42-44). Accordingly, it 
would have been_obv ious to on e_ofordinary skill in the art at the time ofjhe 
current invention was made to use the sum of absolute differences as a part of 
block pixel value matching as taught by Wu in Gulley and Messina's apparatus. 
In doing so, it finds a block of pixels that most closely matches the source block 
of pixels. Therefore, it is advantageous. 



Remarks 

8. As to the remark, Applicant asserted that "Gulley fails to disclose that the 
coprocessor (i.e. graphics processor) responsive to a coprocessor load 
instruction on said main processor to load one or more loaded data words into 
said coprocessor and perform at least one coprocessor processing operation 
specified by said coprocessor load instruction using said one or more loaded 
data words to provide operand data to generate at least one result data word. " 
(page 7, paragraph 1). 

Examiner respectfully traverses Applicant's remark for the following 
reasons: 

First of all, Examiner would like to emphasize that as stated in the 
abstract, the invention of Gulley is clearly related to and involved using 
coprocessor to reduce the load from the main processor and therefore, to 
increase the performance of the main processor. Furthermore, as stated in the 
previous Office Action, Gulley teaches that the coprocessor responsive to a 
coprocessor load instruction (an instruction received from the host) on the main 
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processor (the graphics processor in which the instruction comes from the host) 
to loaj one oniiorejoaded data words \_js \ se\ : ofjpperands) into the coprocessor 
and perform at least one coprocessor processing operation (the operation to be 
performed on the operands) specified by the coprocessor load instruction (the 
instruction received from the host) using the one or more loaded data words (the 
set of operands) to provide operand data to generate at least one result data 
word (the result) (e.g. see Col. 2, lines 3-10 and Fig. 1). 



Conclusion 

THIS ACTION IS MADE FINAL. Applicant is reminded of the extension of 
time policy as set forth in 37 CFR 1 .136(a). 

A shortened statutory period for reply to this final action is set to expire 
THREE MONTHS from the mailing date of this action. In the event a first reply is 
filed within TWO MONTHS of the mailing date of this final action and the advisory 
action is not mailed until after the end of the THREE-MONTH shortened statutory 
period, then the shortened statutory period will expire on the date the advisory 
action is mailed, and any extension fee pursuant to 37 CFR 1 .136(a) will be 
calculated from the mailing date of the advisory action. In no event, however, will 
the statutory period for reply expire later than SIX MONTHS from the mailing 
date of this final action. 

Any inquiry concerning this communication or earlier communications from 
the examiner should be directed to Hetul Patel whose telephone number is (703) 
305-6219. The examiner can normally be reached on M-F 8-4:30. 



Application/Control Number: 10/042,354 



Page 12 



Art Unit: 2186 

If attempts to reach the examiner by telephone are unsuccessful, the 
examiner's supervis or, Matt Kim can be re ached on (703) 305-3821 . The fax 
phone number for the organization where this application or proceeding is 
assigned is 703-872-9306. 

Information regarding the status of an application may be obtained from 
the Patent Application Information Retrieval (PAIR) system. Status information 
for published applications may be obtained from either Private PAIR or Public 
PAIR. Status information for unpublished applications is available through 
Private PAIR only. For more information about the PAIR system, see http://pair- 
direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll- 
free). 





